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(54) Concurrent legacy and native code execution techniques 



(57) A method and apparatus for emulating instruc- 
tions of a microprocessor ("legacy instructions") with an 
incompatible instruction set which provides increased 
throughput relative to known emulation systems. In par- 
ticular, each word of legacy memory is translated into 
an opcode/operand field and a dual function vector/tag 



field. The vector field represent addresses to legacy in- 
struction emulation routines. The tag field is indexed to 
table of "thunk" objects, which can be used for various 
purposes. Such purposes include a disabling part of the 
legacy software, augmenting the legacy software with 
native software, and gathering execution statistics with- 
out modifying the legacy software. 
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Description 

BACKGROUND OF THE INVENTION 

1 . Field of the Invention 

[0001] The present invention relates to a method and 
apparatus for emulating legacy instructions of a micro- 
processor and more particularly to a method and appa- 
ratus which enables a legacy microprocessor to be up- 
graded with an upgrade microprocessor with an incom- 
patible instruction set and execute both legacy and na- 
tive code. 

2. Description of the Prior Art 

[0002] It is known that microprocessors are config- 
ured with different instruction set architectures (ISA). 
The ISA determines the instruction set for a particular 
microprocessor. Application programs are executed by 
the microprocessors normally written in relatively high 
level language, which is compiled into machine instruc- 
tions compatible with the instruction set for the specific 
microprocessor. Microprocessors are increasingly be- 
ing designed to execute instructions faster and faster. 
As such, systems incorporating such microprocessors 
are often upgraded to increase the speed of the system. 
Unfortunately, depending on the particular upgrade, of- 
ten times the instruction set of the upgrade microproc- 
essor is incompatible with the instruction set of the mi- 
croprocessor to be replaced ("legacy microprocessor"). 
As such, in such applications, the existing application 
programs often need to be rewritten in new and modem 
computer languages with modem compilers. Unfortu- 
nately, such an undertaking can be quite cumbersome 
and expensive. 

[0003] Due to the age and obsolescence of many ex- 
isting avionic onboard computers, the reliability of such 
computers is rapidly declining while maintenance is be- 
coming more difficult and costly to achieve. As such, it 
is sometimes required to replace outdated "legacy" mi- 
croprocessors with newer technology microprocessors. 
To work around instructions set incompatibilities, emu- 
lation systems (emulators) have been developed. Em- 
ulators are known which emulate the instructions set of 
the legacy microprocessor in orderto enable the instruc- 
tions of the legacy microprocessor to be "executed" by 
a different microprocessor. Both software and hardware 
based emulators are known. For example, various soft- 
ware emulators forthe F-1 6 avionics integration support 
facility (AISF) common modular environment (COMET) 
are described in document no. F-16AISF-COMET-100 
(EMULATORS-SWD-A, dated May 21, 1996). Hard- 
ware based emulators for military standard MIL-STD- 
1 750A, are discussed in the document entitled Line Re- 
placeable Unit Emulator Hardware Product Fabrication 
Specification, document no. SFF20702 dated April 16, 
1996. 



[0004] Unfortunately, known software emulators have 
been known to be relatively inefficient. In particular, in 
such known software emulators, legacy instructions are 
fetched for the upgrade microprocessor which uses a 

5 look up table to interpret the legacy instruction. Since 
each legacy instruction must be interpreted, computer 
systems which incorporate cache memory are known to 
suffer from relatively high probability of cache misses 
which decreases the overall throughput of the system. 

10 [0005] Another problem with known software emula- 
tors is the inability to optimize the native code environ- 
ment. Thus there is a need to improve emulation soft- 
ware routines which allows for optimizing of the native 
code environment. 

15 

SUMMARY OF THE INVENTION 

[0006] Briefly, the present invention relates to a meth- 
od and apparatus for emulating instructions of a micro- 
20 processor ("legacy instructions") with an incompatible 
instruction set which provides increased throughput rel- 
ative to known emulation systems. In particular, each 
word of legacy memory is translated into an opcode/op- 
erand field and a dual function vector/tag field. The vec- 
25 tor field represent addresses to legacy instruction emu- 
lation routines. The tag field is indexed to table of "thunk" 
objects, which can be used for various purposes. Such 
purposes include a disabling part of the legacy software, 
augmenting the legacy software with native software, 
30 and gathering execution statistics without modifying the 
legacy software. 

BRIEF DESCRIPTION OF THE DRAWING 

35 [0007] These and other objects of the present inven- 
tion will be readily understood with reference to the fol- 
lowing specification and attached drawing, wherein: 
[0008] FIG. 1 is a block diagram illustrating the trans- 
lation of the legacy instruction in accordance with the 
40 present invention. 

[0009] FIG. 2 is a block diagram illustrating one em- 
bodiment (in which vector translation is done by hard- 
ware) for translating the legacy instructions in accord- 
ance with the present invention. 
45 [0010] FIG. 3 is a block diagram illustrating the map- 
ping of the legacy instructions to main memory. 
[0011] FIG. 4 is a block diagram of an upgrade micro- 
processor with cache memory for illustrating the opera- 
tion of the cache system in accordance with the present 
50 invention. 

[0012] FIG. 5 is a block diagram illustrating both the 
indexing to a table of thunk objects and direct vectoring 
to legacy code emulation routines in accordance with 
the present invention. 
55 [0013] FIG. 6 is a block diagram of the translated leg- 
acy words in accordance with the present invention. 
[0014] FIG. 7 is an exemplary flow chart for a gener- 
alized thunk object in accordance with the present in- 
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vention. 

[0015] FIG. 8 is an exemplary flow chart illustrating 
manipulation of the instruction counter for thunk objects 
which disable or branch around legacy code and enable 
execution of native software for use with the present in- 
vention. 

DETAILED DESCRIPTION OF THE INVENTION 

[0016] The present invention relates to a emulation 
system and method for emulating legacy instructions of 
an outdated ("legacy") microprocessor with a new up- 
grade microprocessor with an incompatible instruction 
set and for handling calls to the legacy environment of 
the legacy microprocessor. In such applications, soft- 
ware programs are written which emulate each of the 
legacy instructions using instructions of the upgrade mi- 
croprocessor. In known emulation systems, the emula- 
tion software causes the legacy instruction to be fetched 
and interpreted by the new microprocessor by way of 
the lookup table. As discussed in more detail below, 
such methodology has a significant impact on the 
throughput of the system. In order to increase the 
throughput of the system, the legacy instructions are 
translated into direct vectors to software routines or em- 
ulation code for emulating the legacy instructions. As 
such, as will be discussed in more detail below, the prob- 
ability of cache misses is greatly reduced which results 
in increased throughput of the system. 
[0017] Two embodiments of the invention are de- 
scribed and illustrated. In particular, FIGS. 1-4 illustrate 
one embodiment in which each instruction of the legacy 
code is translated to a direct vector to a software routine 
for emulating the legacy instruction. FIGS. 5 and 6 illus- 
trate an alternate embodiment in which each word of 
legacy code is translated to opcode/operand field and a 
dual function vector/tag field. The vectors are addresses 
to legacy emulation routines for emulating the legacy in- 
structions. The tags are indexed to a table of thunk ob- 
jects for performing various functions, such as disabling 
a portion of the legacy code, augmenting the legacy 
code with native code as well as various other purposes. 

DIRECT VECTORED LEGACY INSTRUCTION SET 
EMULATION 

[0018] Turning to FIG. 1 , embedded software or code 
for a legacy microprocessor is generally identified with 
the reference numeral 20. Such code is normally stored 
in nonvolatile read only memory (ROM). As shown, the 
ROM 20 includes legacy instructions, identified as IN- 
STR 1 , INSTR 2 and INSTR 3, etc. The ROM 20 also 
includes immediate data. The instructions INSTR 1 , IN- 
STR 2, INSTR 3, etc. plus the immediate data in the 
ROM 20, are located, for example, at a base address A 
within the memory space. 

[0019] In accordance with an important aspect of the 
invention, each instruction (i.e. instruction INSTR 1 , IN- 



STR 2, INSTR 3, etc.) is translated to a direct vector to 
a software routine or emulation code for emulating the 
legacy instruction. For example, each legacy instruction 
is mapped or translated to another memory device 22, 
5 such as a ROM 22. The structure of the memory device 
22 with respect to the instructions, is maintained the 
same as the ROM 20 but at a different base address B. 
In other words, instructions in the ROM 20 are located 
at a base address A plus an instruction counter (IC) off- 
set which corresponds to the next legacy instruction to 
be executed. The instructions, INSTR1, INSTR 2, 
INSTR3, etc. are mapped to the ROM 22 at a different 
base address B but with the same IC offset. 
[0020] The direct vectors in the ROM 22 can either be 
JUMP instructions to software routines for emulating the 
legacy instruction or address pointers. For example, the 
direct vectors can represent an offset pointer to an em- 
ulation microcode routine or a pointer to a table which 
contains a pointer to a microcode routine. Regardless, 
it is these vectors that are fetched by the emulation soft- 
ware rather than the legacy instructions. 
[0021 ] The immediate data may be translated into bo- 
gus vectors in the ROM 22 which are not used by the 
emulation software. Rather, the emulation software in 
accordance with present invention may access the im- 
mediate data directly from the legacy code 20 by reading 
the data directly atthe base address Afrom the ROM 20. 

CONCURRENT LEGACY AND NATIVE CODE 



[0022] Alternatively, each word of legacy software can 
be translated into a opcode/operand field, for example, 
a 16 bit field, and an instruction vector/tag field, for ex- 

35 ample, a 1 6 bit field, as shown in Fig. 6. In an exemplary 
32 bit embodiment, the vector field contains the lower 
1 6 bits of the address of the legacy code emulation rou- 
tine 35 (Fig. 5). The opcode field contains the untrans- 
lated opcode from the legacy software. The opcode may 

40 be located at offset 0-1 bytes while the vector is offset 
2-3 bytes. 

[0023] The instruction vector/tag field provides a dual 
function depending on how it is interpreted. The vector/ 
tag field is interpreted as a vector for instructions and a 

45 tag for operands. As mentioned above, the vector is the 
address in the memory where the legacy code emula- 
tion routine resides. The tags are used to index to a table 
of thunk objects, for example as illustrated in Fig. 5. 
These thunk objects can be used to implement various 

50 services, such as optimizing the native code; disabling 
part of the legacy software; augmenting the legacy soft- 
ware with native software; and debugging and gathering 
execution statistics regarding the legacy software with- 
out modifying the legacy software. The range of values 

55 maybe used to indicate whether the field is a tag or vec- 
tor. For example, values of 1 to MAX_TAG-1 may be 
defined as valid tags. The values of MAX_TAG to Oxffff 
may be used to define vectors. A value 0 (no tag) may 
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be used to indicate that no special operation is to be 
performed. 

[0024] Referring to Fig. 5 the tags are indexed to a 
table of thunk objects 36. This table 36 represents a list 
of addresses of thunk object, such as the objects 38. 
These thunk objects 38 can be used for various purpos- 
es as discussed above. The various thunk objects 38 
may return to the emulation software or to other thunk 
objects. 

[0025] Flow charts for exemplary thunk objects are il- 
lustrated in FIGs. 7 and 8. FIG. 7 represents a flow chart 
for a generalized thunk object while FIG. 8 is an exem- 
plary flow chart for a specific thunk object which illus- 
trates manipulation of the instruction counter in order to 
branch around or disable legacy software and optimally 
execute native instructions in accordance with the 
present invention. As discussed above, the principles of 
the present invention are adapted to be used with virtu- 
ally any number of thunk objects. 
[0026] Referring to FIG. 7, the thunk objects are im- 
plemented in general by saving and storing the legacy 
context, for example, as illustrated by the block 40. The 
legacy context consists of the contents of the general 
purpose registers of the microprocessor including the 
instruction counter and other general purpose registers. 
As such, contents of these general purpose registers 
are directly accessible by the thunk objects. As indicated 
in step 42, the saved legacy context (i.e. contents of the 
general purpose registers including the instruction 
counter) is retrieved. Subsequently, in steps 44 and 46, 
the legacy context is man ipulated to perform a newf unc- 
tion as illustrated in step 46. Exemplary thunk objects 
include: optimizing the native code; disabling part of the 
legacy software; augmenting the legacy software with 
native code; and debugging and gathering execution 
statistics regarding the legacy software without modify- 
ing the legacy software. Other thunk objects are also 
within the broad scope of the present invention. After 
the legacy context is manipulated and processed in 
steps 44 and 46, the legacy context (i.e. general pur- 
pose registers) is updated in step 48. The system re- 
turns to the legacy context in step 50. 
[0027] FIG. 8 relates to a more specific thunk object 
which can be usedfordisabling orbranching around leg- 
acy software. Initially in step 52, the systems saves the 
legacy context. In particular, the context of the general 
purpose registers including the instruction counter reg- 
ister is saved in this step, as discussed above. In step 
54, the system retrieves the pointer to the next instruc- 
tion from the instruction counter register. In order to 
branch around or disable legacy software, a displace- 
ment is added to the pointerfrom the instruction counter 
in step 56. This displacement may be used to either 
branch around legacy software or to disable legacy soft- 
ware. The displacement added to the pointer for the in- 
struction counter enables the system to accomplish ei- 
ther of those functions. After the displacement is added 
to the pointer in the instruction counter, a modified in- 



struction counter value is saved and stored in step 58. 
Subsequently, the system returns to the legacy context 
in step 60 to the new value in the instruction counter 
register. 

5 

IMPLEMENTATION 

[0028] Various methods are suitable for translating 
the legacy microcode. Although the implementation is 

10 discussed only in terms of the embodiment illustrated in 
Fig. 1 , thefollowing is also applicableto the embodiment 
illustrated in Figs. 5 and 6. Both hardware and software 
methods for translating the legacy wards are suitable. 
FIG. 2 illustrates a hardware implementation in which a 

15 hardware device 24, such as a ROM containing a lookup 
table, is coupled to the data bus between an upgrade 
microprocessor 26 and the legacy microcode, i.e. ROM 
20. The hardware device 24 is configured such that at 
any time an access to the legacy code is requested , i. 

20 e. base address B plus IC offset, the vector correspond- 
ing to the requested instruction is provided. Alternately, 
the decoder can be bypassed such that an access to 
the legacy code (i.e. base address A plus IC offset) will 
return the untranslated data. Thus, the upgrade proces- 

25 sor can be directed to the associated emulation code 
routine by the fetched vector, or it can access immediate 
data directly from memory. 

[0029] In an alternate embodiment of the invention, 
the legacy emulation code may be translated by soft- 

30 ware when the legacy memory is loaded into main mem- 
ory or modified. In particular, a software program, for 
example, a portion of the initialization software, is used 
to load the legacy code, into the computer system main 
memory 28 (FIG. 3) (e.g. at base address A). In this im- 

35 plementation, after loading the legacy microcode, the di- 
rect vectors (i.e. ROM 22) are loaded into the main 
memory 28 at another location (e.g. base address B), 
allowing the emulation code in the upgrade processor 
to access either the translated 22 or untranslated 20 leg- 

40 acy memory sections. The vectors retrieved from the 
translated memory 22 are used to point to the individual 
software routines in the emulation code 30. With this 
embodiment of the invention, the translated 22 and un- 
translated 20 legacy memory sections need not be dis- 

45 joint, they might be interleaved, such that each vector 
immediately follows or proceeds the untranslated in- 
struction. 

[0030] As mentioned above, the configuration of the 
method and system for emulating legacy instructions is 

50 adapted to improve the throughput of the system. More 
particularly, many known microprocessors include 
cache memories in order to improve the throughput of 
the system. Software fetched from the main memory, is 
copied into the cache memory, which is much quicker 

55 than main memory. Thus, instructions stored in the 
cache memory can be executed much quicker than 
those stored only in main memory. Such cache memo- 
ries are normally formed from high speed static random 
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access memory (SRAM) and are used to store copies 
of data in the main memory or newly stored data. Such 
cache memories operate on the principles that most pro- 
grams execute instructions in sequence, and, due to 
loops, programs are likely to re-use recently fetched in- 
structions. This principle is call locality . Thus, instead 
of fetching asingle instruction at a time, a cache memory 
system looks ahead and fetches blocks of instructions 
in sequence and stores the instructions for quick ac- 
cess. 

[0031] In operation, all data stored in a cache memory 
is stored with what is known as an address tag. The ad- 
dress tag indicates the physical addresses of the data 
in the main memory that is being stored in the cache. 
Whenever the microprocessor initiates a memory ac- 
cess, the address tags in the cache memory are first 
examined to determine if the particular data requested 
is already stored in the cache memory. When the data 
is found in the cache memory, this is known as a cache 
hit and data is immediately available to the microproc- 
essor. If it is determined that the requested data is not 
in the cache memory, this condition is known as a cache 
miss. As a result of a cache miss, the requested data 
then must be retrieved from the main memory at a much 
slower rate. 

[0032] FIG. 4 illustrates atypical configuration of a mi- 
croprocessor with onboard cache memory. In previous 
known systems, the known emulation software fetched 
the legacy instructions themselves. The legacy instruc- 
tions were then interpreted byway of lookup table. Since 
the cache memory is based on the premise that a se- 
quence of data will be requested in sequential memory 
locations, the use of the lookup table is not particularly 
efficient in cache memory systems and results in a rel- 
atively high probability of cache misses. By utilizing di- 
rect vectors, and because of effects of the locality prin- 
ciple in the legacy code and corresponding direct vec- 
tors, the probability of cache misses is greatly reduced 
thereby increasing the overall throughput of the system. 
More particularly, referring to FIG. 4, a memory system 
in accordance with the present invention is illustrated. 
As shown, the system includes the upgrade microproc- 
essor 26 which includes two onboard cache memories 
32 and 34. One cache memory 32 is used for data, form- 
ing a data cache while the other cache memory 34 is 
used for instructions forming an instruction cache. The 
instruction cache 34 may be used almost exclusively for 
the emulation microcode. The data cache 32 may be 
used for the legacy code. 

[0033] In operation, a group of eight vectors may be 
fetched from main memory upon a cache miss and 
stored in the data cache 32 as part of the cache line refill 
operation. Since legacy instructions normally proceed 
in sequential order, the subsequent 7 requests for in- 
struction vectors will normally be resident in the data 
cache 30. If the next legacy instructions to be executed 
is non-sequential but is within the last one thousand in- 
structions to executed, (i.e. local loops), there is a high 



probability that the vector will still be in the data cache 
30. This invention has reduced the probability of cache 
misses and thus increased the throughput of the sys- 
tem. 

5 [0034] Obviously, many modifications and variations 
of the present invention are possible in light of the above 
teachings. Thus, it is to be understood that, within the 
scope of the appended claims, the invention may be 
practiced otherwise than as specifically described 

10 above. 

[0035] What is claimed and desired to be secured by 
Letters Patent of the United States is: 



15 Claims 

1 . An apparatus for emulating legacy instructions of a 
predetermined microprocessor defining a legacy 
environment, the apparatus comprising: 

20 

means for emulating said legacy instructions, 
said emulating means including a software rou- 
tine for emulating various legacy instructions, 
the address of each software routine being 

25 identified by a vector; 

means for translating said legacy instructions 
to a opcode/operand field and a vector/tag field, 
said vectors representing addresses of soft- 
ware routines for emulating said legacy instruc- 

30 tions and said tags indexed to a table of thunk 

objects; for said software routines for emulating 
said legacy instructions; 
means for fetching said vector/tag field in re- 
sponse to requests for said legacy instructions; 

35 and 

means for executing said software routines or 
said thunk objects in response to said fetching 
means. 

40 2. The apparatus as recited in claim 1 , wherein said 
translating means is hardware based. 

3. The apparatus as recited in claim 2, wherein said 
translating means includes a hardware device cou- 

45 pied between an upgrade microprocessor and said 
predetermined memory device for said legacy in- 
structions. 

4. The apparatus as recited in claim 1 , wherein said 
50 translating means is software based. 

5. A method for emulating legacy instructions with an- 
other microprocessor having an incompatible in- 
struction set, the method comprising the steps of: 

55 

a) storing emulation software routines for emu- 
lating legacy instructions; 

b) translating said legacy instructions to a vec- 
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tor/tag field, said vectors representing address- 
es of software routines for emulating said leg- 
acy instructions and said tags indexed to atable 
of thunk objects; 

c) fetching said vector/tag field in response to 5 
requests for said legacy instructions; and 

d) executing said software routines or said 
thunk objects as a function of said vector/tag 
field. 

10 

6. An apparatus for emulating legacy instructions of a 
predetermined microprocessor defining a legacy 
environment, the apparatus comprising: 

means for emulating legacy instructions, said 15 
emulating means including one or more soft- 
ware routines for emulating one or more legacy 
instructions, the address of each software rou- 
tine being identified by a vector; and 
means for bypassing or disabling a portion of 20 
the said legacy instructions. 

7. The apparatus as recited in claim 6, further includ- 
ing means for executing native code. 

25 

8. A method for emulating legacy instructions with an- 
other microprocessor having an incapable instruc- 
tion set the method comprising the steps of: 

storing one or more emulation software rou- 30 
tines for emulating one or more legacy instruc- 
tion; and 

branching around a portion of said legacy in- 
structions. 

35 

9. As recited in claim 8, wherein said branching results 
in disabling a portion of said legacy software. 

10. The method as recited in claim 8 further including 

the steps of running native software along with said 40 
legacy software. 
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